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BLAST Basic Local Alignment Search Tool 

Job Title: BD20521 2: Nucleotide sequence for detecting... 

Please, try .pur new design! 
BLASTN 2.2.18+ 

Reference: Stephen F. Altschul, Thomas L . Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997}, 
"Gapped BLAST and PSI-BLA5T: a new generation of protein database search 
programs", Nucleic Acids Res. 25:338 9-3402. RID: 88NSGD54014 Database: All 
GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS, environmental samples 
or phase 0, 1 or 2 HTGS sequences) 7,064,549 sequences; 24,305,219,031 
total letters 

Qu©ry= gi I 33014982 | dbj | BD205212 . 1 1 Nucleotide sequence for detecting 
enterohemorrhagic Escherichia coli (EHEC) Length=1181 
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Sequences 

(Click hea 
AF401292.1 

AF674613.r 

Yl 12 75.1 
01154972 

^043470 

Cp'6b0035™i"l" 

cpoKosiTi™' 
cpo6i6627i" 

7^34 8 7 0 6 . i 
M8793427r 

"cp666o37.i 
cp6ob639.i 

3D11025il 

cpoooT997i 

CP666795 i 

AFl34403Vr'' 



CR942285.1 
CP 00 6800.1' 



producing significant alignments: 

ders to sort columns) 
Escherichia coli 0157 :H- plasmid 
PSF0157, complete sequence 

' Esc'hie r i c'n i a" coli*" 015 7 *: H 7 pi a sm id "p 0 1 5 1 ] 
complete sequence 



2111 2111 100% 0.0 99% 
2 111 211 1 100%' ' 0 ' 76 99% ' 



E.coli 7.4 kb DNA from plasmid p0157 

Escherichia coii 0157 :H7 str. Sakai 
plasmid p0157 DNA, complete sequence 

Escherichia coli plasmid p0157 ecf4 
gene, partial cds; and ecf3, ecf2, and 
ecfl genes, complete cds 

Sh ig el la d y s e n t e r i a e S d 197 "plasmid 
pSDl_197, complete sequence 

Escherichia coli 53638 plasmid 
p53638_226, complete sequence 



2111 

"2111" 



"2111 100% 0.6 99% [cj 
2111 100% "6. 6 "99% 



"2102 2102 100% 0.0 99% 



13 



Shigella boydii CDC 3083-94 plasmid 
pBS512_211, complete sequence 

Shigella flexneri 2a str. 301 virulence 
plasmid pCP301, complete sequence 



Shigella flexneri plasmid pINV_F6_M1382 
0RF186 (0RF186), VirK {virK) , and MsbB2 
(msbB2) genes, complete cds 

S H i g e i la flex rier i vi r u le nee ' pi a smi d 
pWRlOO: from 1 to 213494 

Shigella flexneri 5a plasmid virulence 
plasmid pWRSOl, complete sequence 



Shigella flexneri plasmid pSF5, 
complete sequence 

'Shrgeiia''boyd 
complete sequence 

Shigella sonnei Ss04 6 plasmid pSS_04 6, 
complete sequence 

Shigella f 1 exne r i " pi a "s" mi d pMYSH6060 
virK gene for virulence protein, 
complete cds 

Escherichia coli E24377A plasmid 
pETEC_7 4, complete sequence 

Escherichia coli E24377A plasmid 
pETEC_80, complete sequence 

Escherichia coli plasmid p086Ai DNA, 
complete sequence 



Escherichia coli plasmid pAA2 Shf 
(shf), hexosyltransferase homolog 

(capU) , and VirK (virK) genes, complete 
cds 

Escherichia coli plasmid pCoo 

Escherichia coli E24'377A7 complete 
genome 



138 

132' 
"132 " 

132 

132' 

132" 

132 

132' 
132 

111 

"iii' 

111" 

....... 



111 

"10 7" 



138 24% 6e-29 71% 
132 24% -3^27 71%' 



132 24% 3e-27 71% 



132 24% 3e-27 71% 



132 24% 3e-27 71% 



132 24% 3e-27 71% 



132 24% 3e-27 71% 



132 24% 3e-27 71% 
132 "24% 3e-27 71% 



132 24% 3e-27 71% 



132 24%' 3e-27 71% 



111 17% 9e-217H 



111 17% 9e-2171% 



111 17% 9e-21 71% 



111 17% 9e-21 71% 



11.1" 17% ' 9e-21 71% 
107 24% le-19 69%" 



AC133908.7 



46.4 46.4 2% 0.30 93% 
AC127327.4 Mus musculus BAC clone RP23-254M18 f rom "4 674'" 467 4 2%" 0" 30 93%"' 



Mus musculus chromosome 5, clone RP24- 
299L9, complete sequence 
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5, complete sequence 



44.6 44.6 2% 



(HoxCl3aa) , 
(HoxCl2aa) , 
(HoxCllaa) , 
(HoxClOaa) , 



EU025714.1 Sa'imo salar retinoic acid receptor 
gamma a (Rarga), coiled-coil 
transcriptional coactivator a 
(Kiaal536), homeobox protein HoxC13aa 
horaeobox protein HoxC12aa 
homeobox protein HoxCllaa 
homeobox protein HoxClOaa 
homeobox protein HoxC9aa 
(HoxC9aa), homeobox protein HoxC8aa 
(HoxC8aa), and homeobox protein HoxC6aa 
(HoxC6aa) genes, complete cds 

Photorhabdus luminescens subsp . 44.6 44.6 2% 

laumondii TT01 complete genome; segment 
3/17 

Operophtera brumata reovirus segment 9, 42.8 42.8 2% 
complete sequence 



1.1 93* 



BX571861.1 

DQ19'2243'JI * 
AC145866. 3 
M615928.1 
AC004844.1 



i.T ibo% 

3.7 92% jjjtj 

3.7 90 % 
'3.7 16 0% 

^i/f' ' '90% £jj|'' 



Pan troglodytes BAG clone RP43-21B7 
from chromosome 7, complete sequence 

Bacteroides thetaiotaomicron VPI-5482, 42.8 
complete genome 



42.8 42.8 2% 



42.8 1% 



Homo sapiens PAC clone RP4-613I23 from 42.8 
7pll-pl3, complete sequence 



42,8 2% 



ir\ i—i ir\ r\ r\ r\ 



NCBI Blast:BD205212:Nucleotide sequence for detecting.. 
Alignments 



Page 4 sur 1 7 



>gb|AF401292.1| 

Length-121239 



03 Escherichia coii 0157 : H- plasmid pSF0157, complete sequence 



Score = 2111 bits (2340), Expect - 0.0 
Identities - 1178/1182 (99%), Gaps = 1/1182 (0%) 
St rand^Plus /Minus 



Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 



1 

29247 
60 

29187 

120 

29127 

180 

29067 

240 

29007 

300 

28947 

360 

28887 

420 

28827 

480 

28767 

540 

28707 

600 

28647 

660 

28587 

720 

28527 

780 

28467 

840 

28407 

900 

28347 

960 



CTGCAGGAGA-TGG-v" ->- >;; ;: •. ^.TTGCCCATCCCAGCGCGCTCCAGCT 

I I I I I I I I II II I I I I 1 I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I t I I I I I M 
CTGCAGGAGAGTGGAAAAAAAGCCAAAATAAAAAATTGCCCATCCCAGCGCGCTCCAGCT 



59 
29 



GAAAGTAGGCCTGTTCTGTCCGGTATTTAAATGCATTGACCGTCCCCGTATTTAAACAAT 11 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! i I I I I ! I I I I I I I I I I I I I I I I 

GAAAGTAGGCCTGTTCTGTCCGGTATTTAAATGCATTGACCGTCCCCGTATTTAAACAAT 2 9 

GTGATAAATTACTCCGTTACCGGAAAACCGCTGAACAAAATTCGGGCTGAAAAGAGGATC 17 

I I i I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I M I I I I I 

GTGATAAATTACTCCGTTACCGGAAAACCGCTGAACAAAATTCGGGCTGAAAAGAGGATC 2 9 

CGCCGTTATCTGTTGCATTTCCCCTTAGCCTGACTAGCCAGAGACACAATGATCTGTGCC 23 
I I I i I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

CGCCGTTATCTGTTGCATTTCCCCTTAGCCTGACTAGCCAGAGACACAATGATCTGTGCC 29 

GTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCTGCCATCATCCGG 29 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCTGCCATCATCCGG 28 

AAGCGTTCCGGTCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCC 35 

I I I I I I II I I E I I I I I I I I i I I I I I I I I I I I I I I I II I I I I I I I I I I I I 1 I I I M I I M I 
AAGCGTTCCGGTCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCC 28 

ACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCA 41 

II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I IE I I I I I I I I I 
ACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCA 2 8 

CTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCA 4 7 

I I I I I I I I I M II I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I M II I I I I I I I I I M 

CTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCA 28 

TTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGGTAACGCTGGGAAAAGG 53 
I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGGTAACGCTGGGAAAAGG 28 

GCACCTGCCATTAACACATCTCCGCTCATTCCCAGGTGTTCTGTCTGCTGACGCAGACGT 59 

M I II I I I I I I II I I I I M I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I 

GCACCTGCCATTAACACATCTCCGCTCATTCCCAGGTGTTCTGTCTGCTGACGCAGACGT 28 

GCTTCGTATTCTTCACGCCCGGCGCCCACCACGAGCCAGCGAAATGATTTCCCTTCCATC 65 

I I I I I I I I II I I I I II I I I I I t I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I 

GCTTCGTATTCTTCACGCCCGGCGCCCACCACGAGCCAGCGA7VATGATTTCCCTTCCATC 2 8 

TTCAGCTGATACAATACACGCAGCATAAATTCATGTCCTTTTTCGGGACGTAGCATCCCC 71 

I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I 1 I I I I I 

TTCAGCTGATACAATACACGCAGCATAAATTCATGTCCTTTTTCGGGACGTAGCATCCCC 28 

ACCTGAACGATAAGCGGAACATTGTCTGCTGATGCAGCCCAGGCGTGGATATGCAGGGGT 7 7 

M I I I I I I I I I I I I I I M M I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I 

ACCTGAACGATAAGCGGAACATTGTCTGCTGATGCAGCCCAGGCGTGGATATGCAGGGGT 28 

AACGGTCGCATGGCTTCATTATGCAATGCGGGCCAGTCGAAACCCGGTGGAATAACCGTT 8 3 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I II I I II I I I I I i I I 

AACGGTCGCATGGCTTCATTATGCAATGCGGGCCAGTCGAAACCCGGTGGAATAACCGTT 28 

ACCGGTGTCCTGACACCTTCCGCCATCAGATGCGCCATCATGGGTGAGATAGGCACAACA 8 9 

I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

ACCGGTGTCCTGACACCTTCCGCCATCAGATGCGCCATCATGGCTGAGCTAGGCACAACA 28 

ATGAAATCACACAGATAATTCAGGGAAAACGTTCTGGTCTTACGGGTGATGTAGGTTTTT 95 

I I I I I II I I I I II M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ATGAAATCACACAGATAATTCAGGGAAAACGTTCTGGTCTTACGGGTGATGTAGGTTTTT 28 

TGTCTGACAATAGTGAAGCGGTGACAGCATATCAGACGGCTCAGTCCTGCTATATTACTG 10 
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Sbjct 


28287 


Query 


1020 


Sbjct 


28227 


Query 


1080 


Sbjct 


28167 


Query 


1140 


Sbjct 


28107 



I I I I I I I I I I I I I I I I I M II I I I I I j I I I I I I I I I I I I M II I I I I I I I I I ! I I I 11 I 
TGTCTGACAATACTGAAGCGGTGACAGCATATCAGACGGCTCAGTCCTGCTATATTACTG 28 

TCATGGCCACTATGGCAGATGACCAGATCAGGTTTAAATTCCCCGATAATCCGTCGAAGT 10 

I I I I I I i I I I I I I I I I I I I I! I I I I I I I M M I I I I I I I I li I I I I I II I I I I I I I II II 
TCATGGCCACTATGGCAGATGACCAGATCAGGTTTAAATTCCCCGATAATCCGTCGAAQT 20 

CTGAGGATGGAAGGAAGGTGAAGGCTGTTCCTGAAAGGAATAAAAGTGACATCATGCCCT 11 
I I M I M M I I I I I I I I i I I I II I I I I I I I I I I I I I I I I I I I I i I I M I 1 I I I I I I I I I I 
CTGAGGATGGAAGGAAGGTGAAGGCTGTTCCTGAAAGGAATAAAAGTGACATCATGCCCT 28 

CTTTTTCTGGCTTCCGGAGCAATTTTACTTTTTTCTCTGCAG 1181 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II M 
CTTTTTCTGGCTTCCGGAGCAATTTTACTTTTTTCTCTGCAG 28066 



>gb | AF074 613 . 1 1 13 Escherichia coli 0157 :H7 plasmid p0157, complete sequence 
Length=92077 



Score - 2111 bits (2340), Expect = 0.0 
Identities = 1178/1182 (99%), Gaps = 1/1182 (0%) 
Strand=Plus /Minus 



Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 



1 

19435 
60 

19375 

120 

19315 

180 

19255 

240 

19195 

300 

19135 

360 

19075 

420 

19015 

480 

18955 

540 

18895 

600 

18835 

660 

18775 

720 

18715 

780 

18655 

840 



CTGCAGGAGA-TGG:.! /.< CjC .. :••;.>. a *: aaaa.;^ TTGCCCATCCCAGCGCGCTCCAGCT 

I I I I I I I I II I II I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 I I I I I t I I I I I 
CTGCAGGAGAGTGGAAAAAAAGCCAAAATAAAAAATTGCCCATCCCAGCGCGCTCCAGCT 

GAAAGTAGGCCTGTTCTGTCCGGTATTTAAATGCATTGACCGTCCCCGTATTTAAACAAT 
I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
GAAAGTAGGCCTGTTCTGTCCGGTATTTAAATGCATTGACCGTCCCCGTATTTAAACAAT 

GTGATAAATTACTCCGTTACCGGAAAACCGCTGAACAAAATTCGGGCTGAAAAGAGGATC 
I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I 
GTGATAAATTACTCCGTTACCGGAAAACCGCTGAACAAAATTCGGGCTGAAAAGAGGATC 

CGCCGTTATCTGTTGCATTTCCCCTTAGCCTGACTAGCCAGAGACACAATGATCTGTGCC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I M I I I I I I I I I I I I I I I I I 
CGCCGTTATCTGTTGCATTTCCCCTTAGCCTGACTAGCCAGAGACACAATGATCTGTGCC 

GTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCTGCCATCATCCGG 

I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I i I I I I 
GTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCTGCCATCATCCGG 



59 
19 
1 1. 
19 
17 
19 
23 
19 
29 
19 
35 



AAGCGTTCCGGTCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I 1 I I I I I I I I I I II I I I I I I M I 
AAGCGTTCCGGTCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCC 19 

ACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCA 41 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 M I I I I I I I I I 
ACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCA 19 

CTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCA 4 7 

I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
CTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCA 18 

TTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGGTAACGCTGGGAAAAGG 53 

I I I I I I 1 I I I I I I I I I I I I I I I I I I I I II II I I I I I II I I I I II I I I II I ! I I I I I I I I I 
TTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGGTAACGCTGGGAAAAGG 18 

GCACCTGCCATTAACACATCTCCGCTCATTCCCAGGTGTTCTGTCTGCTGACGCAGACGT 59 
i I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
GCACCTGCCATTAACACATCTCCGCTCATTCCCAGGTGTTCTGTCTGCTGACGCAGACGT 18 

GCTTCGTATTCTTCACGCCCGGCGCCCACCACGAGCCAGCGAAATGATTTCCCTTCCATC 65 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I I I 
GCTTCGTATTCTTCACGCCCGGCGCCCACCACGAGCCAGCGAAATGATTTCCCTTCCATC 18 

TTCAGCTGATACAATACACGCAGCATAAATTCATGTCCTTTTTCGGGACGTAGCATCCCC 71 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
TTCAGCTGATACAATACACGCAGCATAAATTCATGTCCTTTTTCGGGACGTAGCATCCCC 18 

ACCTGAACGATAAGCGGAACATTGTCTGCTGATGCAGCCCAGGCGTGGATATGCAGGGGT 77 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
ACCTGAACGATAAGCGGAACATTGTCTGCTGATGCAGCCCAGGCGTGGATATGCAGGGGT 18 

AACGGTCGCATGGCTTCATTATGCAATGCGGGCCAGTCGAAACCCGGTGGAATAACCGTT 8 3 
I I I I I I I I I I I i I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I 
AACGGTCGCATGGCTTCATTATGCAATGCGGGCCAGTCGAAACCCGGTGGAATAACCGTT 18 

ACCGGTGTCCTGACACCTTCCGCCATCAGATGCGCCATCATGGGTGAGATAGGCACAACA 8 9 
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Sbjct 


13595 


Query 


900 


Sbjct 


18535 


Query 


960 


Sbjct 


18475 


Query 


1020 


Sbjct 


18415 


Query 


1080 


Sbjct 


18355 


Query 


1140 


Sbjct 


18295 



I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I ! I I I 
ACCGGTGTCCTGACACCTTOCGCCATCAGATGCGCCATCATGGCTGAGCTAGGCACAACA 18 

ATGAAATCACACAGATAATTCAGGGAAAACGTTCTGGTCTTACGGGTGATGTAGGTTTTT 95 

I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I f I I 1 _ 

ATGAAATCACACAGATAATTCAGGGAAAACGTTCTGGTCTTACGGGTGATGTAGGTTTTT 18 

TGTCTGACAATAGTGAAGCGGTGACAGCATATCAGACGGCTCAGTCCTGCTATATTACTG 10 

If I I I I I I I M I M I I I E I I I M I I I I I I I II I I I I I M I I I I I I I I I I I I M I I I I I I 
TGTCTGACAATACTGAAGCGGTGACAGCATATCAGACGGCTCAGTCCTGCTATATTACTG 18 

TCATGGCCACTATGGCAGATGACCAGATCAGGTTTAAATTCCCCGATAATCCGTCGAAGT 10 

I I i I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I II I I I I I 
TCATGGCCACTATGGCAGATGACCAGATCAGGTTTAAATTCCCCGATAATCCGTCGAAGT 18 

CTGAGGATGGAAGGAAGGTGAAGGCTGTTCCTGAAAGGAATAAAAGTGACATCATGCCCT 11 

I I I I I I I I ! I I I I I I I I I I I ! I I I I I I I I II I ! I It I I I I I I I I I I I I I I ! I II I I I I I I 
CTGAGGATGGAAGGAAGGTGAAGGCTGTTCCTGAAAGGAATAAAAGTGACATCATGCCCT 18 

CTTTTTCTGGCTTCCGGAGCAATTTTACTTTTTTCTCTGCAG 1181 
i I I I I I I i M I M I I I I I I I I I I I I I 1 I M I I I I II I I I M I 
CTTTTTCTGGCTTCCGG AGCAATTTTACTTTTTTCTCTGCAG 18254 

>emb|Y11275.1|EC74P0157 E E.coli 7.4 kb DNA from plasmid p0157 
Length=7395 

Score = 2111 bits (2340), Expect = 0.0 
Identities - 1178/1182 (99%), Gaps = 1/1182 (0%) 
St rand=Plus /Minus 

CTGCAGGAGA-TGG j /■< h ■■<, . > .-.< i ,: ., -< ... i :< • : ... . s ,< h * TTGCCC ATCCCAGCGCGCTCCAGCT 59 
II II I I i I I I IE I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I i I I ! I I I I I i I i ! 
CTGCAGGAGAGTGGAAAAAAAGCCAAAATAAAAAATTGCCCATCCCAGCGCGCTCCAGCT 282 

GAAAGT AGGCCTGTTCTGTCCGGTATTTAAATGCATTGACCGTCCCCGTATTTAAACAAT 119 
I I I II I I I I 1 I I I I I I I I I I I I I f M I I I I I I I M I I f I I I I I I I I I I I I I I I I I I I I II 
GAAAGTAGGCCTGTTCTGTCCGGTATTTAAATGCATTGACCGTCCCCGT ATTTAAAC AAT 27 6 

GTGAT AAATTACTCCGTTACCGGAAAACCGCTGAAC AAAATTCGGGCTGAAAAGAGGATC 179 

I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I II 
GT GAT AAATTACTCCGTTACCGGAAAACCGCTGAAC AAAATTCGGGCTGAAAAGAGGATC 270 

CGCCGTTATCTGTTGCATTTCCCCTTAGCCTGACTAGCCAGAGACACAATGATCTGTGCC 239 
I II I I I I I I II I I I I I I II I I M I I I I I M I I I I I I I I I I E I I I I I I i I I I I I I I I I I I I 
CGCCGTTATCTGTTGCATTTCCCCTTAGCCTGACTAGCCAGAGACACAATGATCTGTGCC 264 

GTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCTGCCATCATCCGG 299 

I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
GTTCTGTTAATATCAAlACCGGTACTCAATATCTTCTCTGGCGCTGGCTGCCATCATCCGG 253 

AAGCGTTCCGGTCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCC 359 
I I I I I I I I I I I I i I I I I I I I I I I I I I I I II II I I I I I I I I II I I II I I I I I 1 I I I I I I I I 
AAGCGTTCCGGTCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCC 252 

ACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCA 419 
I II I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I II I I I I I I I I I I I I I I I I II 
ACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCA 24 6 

CTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCA 47 9 
I I M I I I I I I I I I I I I II I I I I I I I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGGCGATAACGGGCACGCCGGAGACTGACGCTTC AGCCAGTACCATACCAAACGCTTCA 240 

TTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGGTAACGCTGGGAAAAGG 539 
I I I I I M I I I I I M M I I I ! I I I I I I I M I I M I I II I II I I I I I I I I II I I I I I I I I I I 
TTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGGTAACGCTGGGAAAAGG 234 

GCACCTGCCATTAACACATCTCCGCTCATTCCCAGGTGTTCTGTCTGCTGACGCAGACGT 599 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I II 
GCACCTGCCATTAACACATCTCCGCTCATTCCCAGGTGTTCTGTCTGCTGACGCAGACGT 228 

GCTTCGTATTCTTCACGCCCGGCGCCCACCACGAGCCAGCGAAATGATTTCCCTTCCATC 659 
I I I I II I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCTTCGTATTCTTCACGCCCGGCGCCCACCACGAGCCAGCGAAATGATTTCCCTTCCATC 222 

TTCAGCTGATACAATACACGCAGCATAAATTCATGTCCTTTTTCGGGACGTAGCATCCCC 719 
I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I M I I I I i II I I I I I II I I I I I M II I I M 
TTCAGCTGATACAATACACGCAGCATAAATTCATGTCCTTTTTCGGGACGTAGCATCCCC 216 

ACCTGAACGATAAGCGGAACATTGTCTGCTGATGCAGCCCAGGCGTGGATATGCAGGGGT 7 7 9 
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I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I ! M I I I I I I I I I I I I I 
ACCTGAACGATAAGCGGAACATTGTCTGCTGATGCAGCCCAGGCGTGGATATGCAGGGQT 210 

AACGGTCGCATGGCTTCATTATGCAATGCGGGCCAGTCGAAACCCGGTGGAATAACCGTT 839 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I ! II M I M I I I I I 
AACGGTCGCATGGCTTCATTATGCAATGCGGGCCAGTCGAAACCCGGTGGAATAACCGTT 204 

ACCGGTGTCCTGACACCTTCCGCCATCAGATGCGCCATCATGGGTGAGATAGGCACAACA 8 99 
M I I i I I I I I I I I II I I I I I I I I I I M i I I I I I i M 1 I M I I I MM I II I M I I M I 
ACCGGTGTCCTGACACCTTCCGCCATCAGATGCGCCATCATGGCTGAGCTAGGCACAACA 198 

ATGAAATCACACAGATAATTCAGGGAAAACGTTCTGGTCTTACGGGTGATGTAGGTTTTT 9 59 

I II I II I II i II ! I I M I I M II I II I M M I I I I M M M I I I II I I I I II M M I I II 
ATGAAATCACACAGATAATTCAGGGAAAACGTTCTGGTCTTACGGGTGATGTAGGTTTTT 192 

TGTCTGACAATAGTGAAGCGGTGACAGCATATCAGACGGCTCAGTCCTGCTATATTACTG 101 

II II I I I I I I I I I I M I I I I I I II I I I II II I I I I I II II I II I M I II II i I I II I I I 
TGTCTGACAATACTGAAGCGGTGACAGCATATCAGACGGCTCAGTCCTGCTATATTACTG 186 

TCATGGCCACT ATGGCAGATGACCAGATCAGGTTT AAATTCCCCGAT AATCCGTCGAAGT 107 
I I I I II I I I II I I II I I If M M I I I II II II II II I I II I i II M I I I II I II II I II I 
TCATGGCCACT ATGGCAGATGACCAGATCAGGTTT AAATTCCCCGAT AATCCGTCGAAGT 180 

CTGAGGATGGAAGGAAGGTGAAGGCTGTTCCTGAAAGGAATAAAAGTGACATCATGCCCT 113 
I I I II I I I I I I II I II II I I I I I I I I II II II I I I I I I I II II II II I I I M II II M II 
CTGAGGATGGAAGGAAGGTGAAGGCTGTTCCTGAAAGGAATAAAAGTGACATCATGCCCT 17 4 

CTTTTTCTGGCTTCCGGAGCAATTTTACTTTTTTCTCTGCAG 1181 
I I II I I I I I II II I II II I I II I I II II II I I I I I I II II I I 
CTTTTTCTGGCTT CCGGAGCAATTTTACTTTTTTCTCTGCAG 17 05 



>dbj |AB011549.2| El 

sequence 
Length=92721 



Escherichia coli 0157 :H7 str. Sakai plasmid p0157 DNA, com; 



Score = 2111 bits (2340), Expect = 0.0 
Identities = 1178/1182 (99%), Gaps = 1/1182 (0%) 
Strand=Plus /Minus 

CTGCAGGAGA-TGG :, a - a « .-• ;cc j a .a h ,■; a a * a TTGCCCATCCCAGCGCGCTCCAGCT b 9 

I I II I II I II I I I M II II II iM I II I I I II I II I I I II II M II II II II II M M I I 

CTGCAGGAGAGTGGAAAAAAAGCCAAAATAAAAAATTGCCCATCCCAGCGCGCTCCAGCT 88 

GAAAGTAGGCCTGTTCTGTCCGGTATTTAAATGCATTGACCGTCCCCGTATTTAAACAAT 11 

I I II I II I I I II II I I M II I II I I II I II II I I II I I II M I II II I I II II I II I M I 

GAAAGTAGGCCTGTTCTGTCCGGTATTTAAATGCATTGACCGTCCCCGTATTTAAACAAT 88 

GTGATAAATTACTCCGTTACCGGAAAACCGCTGAACAAAATTCGGGCTGAAAAGAGGATC 17 

I ! I II II I I It II M I II I I II II I M I I M II II II I M I I I II II II I II II I I II I I 

GTGATAAATTACTCCGTTACCGGAAAACCGCTGAACAAAATTCGGGCTGAAAAGAGGATC 8 8 

CGCCGTTATCTGTTGCATTTCCCCTTAGCCTGACTAGCCAGAGACACAATGATCTGTGCC 2 3 

I I II I I I I I I I I M I II I II I II II I II I I II II I M I II M I I II II I II II II II II I 
CGCCGTTATCTGTTGCATTTCCCCTTAGCCTGACTAGCCAGAGACACAATGATCTGTGCC 88 

GTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCTGCCATCATCCGG 2 9 

II I I I II M II II I II I II II II I I I II I I II I II II I II I I I I I I I II I II II I II I II 
GTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCTGCCATCATCCGG 88 

AAGCGTTCCGGTCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCC 35 

II I I II II II I I II II II II I I I I I I I II I I I II II I II I II II II II I I I M I I II II I 

AAGCGTTCCGGTCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCC 8 8 

ACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCA 41 

II II I II II I I M I II II II I II I I II I II I 1 I M I I I II I II I I I I I I II M I I II I II 

ACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCA 88 

CTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCA 4 7 

I II II I II I II II I II II II I I I I I II II I I II I I I I I II I II I I M M I I II I M I I II 
CTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCA 88 

TTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGGTAACGCTGGGAAAAGG 53 

II I I ! II M M II I I II II I I I II I M I I I ! I II I M II M II I I II II II II I I I I I II 
TTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGGTAACGCTGGGAAAAGG 88 

GCACCTGCCATTAACACATCTCCGCTCATTCCCAGGTGTTCTGTCTGCTGACGCAGACGT 59 
I M M I II II II I II II I II II II I II II I I II I I I I I II I I I I I I II I I II I I I I I I M 

GCACCTGCCATTAACACATCTCCGCTCATTCCCAGGTGTTCTGTCTGCTGACGCAGACGT 88 
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GCTTCGTATTCTTCACGCCCGGCGCCCACCACGAGCCAGCGAAATGATTTCCCTTCCATC 65 
I I I I I i i I ! I I I I I I I I I 1 I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I i I i I I 
GCTTCGTATTCTTCACGCCCGGCGCCCACCACGAGCCAGCGAAATGATTTCCCTTCCATC 88 

TTCAGCTGATACAATACACGCAGCATAAATTCATGTCCTTTTTCGGGACGTAGCATCCCC 71 

I I I I I I I I I I I I I I I II I I I I I I I I I I f I I I I M 1 I I I I I I I I I I I I I I I I I I I i I I I I I 
TTCAGCTGATACAATACACGCAGCATAAATTCATGTCCTTTTTCGGGACGTAGCATCCCC 88 

ACCTGAACGATAAGCGGAACATTGTCTGCTGATGCAGCCCAGGCGTGGATATGCAGGGGT 7 7 

I II I I I I I I I I I II I I I M M I I I I II I i I I I I I I I I I M I I I I I i ! I I I I I I I I I I I II 
ACCTGAACGATAAGCGGAACATTGTCTGCTGATGCAGCCCAGGCGTGGATATGCAGGGGT 88 

AACGGTCGCATGGCTTCATTATGCAATGCGGGCCAGTCGAAACCCGGTGGAATAACCGTT 83 
I I M I I I I I I I I I I I M i I I I I I I I I I I I I M I I I I I I I I 1 I I I I I I M I I II M I I I I I 
AACGGTCGCATGGCTTCATTATGCAATGCGGGCCAGTCGAAACCCGGTGGAATAACCGTT 88 

ACCGGTGTCCTGACACCTTCCGCCATCAGATGCGCCATCATGGGTGAGATAGGCACAACA 89 

I i I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I E If 
ACCGGTGTCCTGACACCTTCCGCCATCAGATGCGCCATCATGGCTGAGCTAGGCACAACA 88 

ATGAAATCACACAGATAATTCAGGGAAAACGTTCTGGTCTTACGGGTGATGTAGGTTTTT 95 

I I I I I I I ! I I I I I I I I I II I I I I I I I I I I I II I I I II I I I II I II I I I I i I I I I I I M I I 
ATGAAATCACACAGATAATTCAGGGAAAACGTTCTGGTCTTACGGGTGATGTAGGTTTTT 87 

TGTCTGACAATAGTGAAGCGGTGACAGCATATCAGACGGCTCAGTCCTGCTATATTACTG 10 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I i I I 
TGTCTGACAATACTGAAGCGGTGACAGCATATCAGACGGCTCAGTCCTGCTATATTACTG 87 

TCATGGCCACTATGGCAGATGACCAGATCAGGTTTAAATTCCCCGATAATCCGTCGAAGT 10 

II I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCATGGCCACTATGGCAGATGACCAGATCAGGTTTAAATTCCCCGATAATCCGTCGAAGT 87 

CTGAGGATGGAAGGAAGGTGAAGGCTGTTCCTGAAAGGAATAAAAGTGACATCATGCCCT 11 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
CTGAGGATGGAAGGAAGGTGAAGGCTGTTCCTGAAAGGAATAAAAGTGACATCATGCCCT 87 

CTTTTTCTGGCTTCCGGAGCAATTTTACTTTTTTCTCTGCAG 1181 

I II f I I I I I I I I I I I 1 I I I I I I I I II I I I I I I I I I I I I I I I I 
CTTTTTCTGGCTTCCGGAGCAATTTTACTTTTTTCTCTGCAG 87 769 



>gb| AF043470 . 1 1 AF043470 B Escherichia coli plasmid p0157 ecf4 gene, partial c< 

ecf2, and ecfl genes, complete cds 

Length=5612 

GENE ID: 5290933 SF0157 jpl6 I w0016 [Escherichia coli) 
(10 or fewer PubMed links) 

Score = 2102 bits (2330), Expect = 0.0 
Identities = 1176/1182 (99%) , Gaps = 1/1182 (0%) 
Strand=Plus/Plus 

CTGCAGGAGA-TGG : : ; ■•: -i f i: <•;,.;,; ,, .;c;TTGCCCATCCCAGCGCGCTCCAGCT 59 

I I I I I I I I I I I I I I i I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGCAGGAGAGTGGAAAAAAAGCCAAAAT AAA AAATTGCCCATCCCAGCGCGCTCCAGCT 2 4 6 

GAAAGT AGGCCTGTTCTGTCCGGTATTT AA ATGC ATTGACCGTCCCCGT ATTTAAACAAT 119 
I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GAAAGT AGGCCTGTTCTGTCCGGTATTT AAATGCATTGACCGTCCCCGT ATTTAAACAAT 2 52 

GTGATAAATTACTCCGTTACCGGAAAACCGCTGAACAAAATTCGGGCTGAAAAGAGGATC 17 9 

I I I I I I I I I I II I I I I I I I I I II I I M I M I I I I I I I I I I I I I I M I II I I I I I I I M I I 
GTGATAAATTACTCCGTTACCGGAAAACCGCTGAACAAAATTCGGGCTGAAAAGAGGATC 2 58 

CGCCGTTATCTGTTGCATTTCCCCTTAGCCTGACTAGCCAGAGACACAATGATCTGTGCC 2 39 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
CGCCGTTATCTGTTGCATTTCCCCTTAGCCTGACTAGCCAGAGACACAATGATCTGTGCC 2 64 

GTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCTGCCATCATCCGG 299 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCTGCCATCATCCGG 270 

AAGCGTTCCGGTCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCC 359 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
AAGCGTTCCGGTCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCC 27 6 

ACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCA 419 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
ACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCA 282 
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CTGGCGAT A ACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCA 4 7 9 

I I I I I I I I I I I I li I I I I I I I I I I I I I I I II I I I II I I I I I I i I I I I I I I fl ! I I I I I I I 
CTGGCGAT AACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCA 288 

TTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGGTAACGCTGGGAAAAGG 539 

I I II I I I i I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I i 
TTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGGTAACGCTGGG^AAAGQ 294 

GCACCTGCCATTAACACATCTCCGCTCATTCCCAGGTGTTCTGTCTGCTGACGCAGACGT 599 
I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I 1 I I I I I I I I I I M ! 
GCACCTGCCATTAACACATCTCCGCTCATTCCCAGGTGTTCTGTCTGCTGACGCAGACTT 300 

GCTTCGTATTCTTCACGCCCGGCGCCCACCACGAGCCAGCGAAATGATTTCCCTTCCATC 659 

I i I I I I I I I I I I I I II I I I I I I I I I I I I ! I I I I I II I I I I I I I I I I I M I I I I I I II I I I 
GCTTCGTATTCTTCACGCCCGGCGCCCACCACGAGCCAGCGAAATGATTTCCCTTCCATC 30 6 

TTCAGCTGATACAATACACGCAGCAT AAATTC ATGTCCTTTTTCGGGACGTAGCATCCCC 719 
I I I I II I I I M I I I M I I I I I I II I I I I II I I I I M I I I I M I I I I I I I I I I I I II I I I I 
TTCAGCTGATACAATACACGCAGCATAAATTCATGTCCTTTTTCGGGACGTAGCATCCCC 312 

ACCTGAACGATAAGCGGAACATTGTCTGCTGATGC AGCCCAGGCGTGGATATGCAGGGGT 7 7 9 
I I I I I I M I I I I I I I I I M i I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I 
ACCTGAACGATAAGCGGAACATTGTCTGCTGATGCAGCCCAGGCGTGGATATGCAGGGGT 318 

AACGGTCGC ATGGCTTCATTATGC AATGCGGGCCAGTCGAAACCCGGTGGAATAACCGTT 839 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AACGGTCGCATGGCTTCATTATGCAATGCGGGCCAGTCGAAACCCGGTGGAATAACCGTT 32 4 

ACCGGTGTCCTGACACCTTCCGCCATCAGATGCGCCATCATGGGTGAGATAGGCACAACA 899 

I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACCGGTGTCCTGACACCTTCCGCCATCAGATGCGCCATCATGGCTGAGCTAGGCACAACA 330 

ATGAAATCACACAGATAATTCAGGGAAAACGTTCTGGTCTTACGGGTGATGTAGGTTTTT 95 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGAAATCACACAGAT AATTCAGGGAAAACGTTCTGGTCTT ACGGGTGATGT AGGTTTTT 336 

TGTCTGACAATAGTGAAGCGGTGACAGCATATCAGACGGCTCAGTCCTGCTATATTACTG 101 
I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I 
TGTCTGACAATACTGAAGCGGTGACAGCATATCAGACGGCTCAGTCCTGCTATATTACTG 34 2 

TCATGGCCACTATGGCAGATGACCAGATCAGGTTTAAATTCCCCGATAATCCGTCGAAGT 107 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCATGGCCACTATGGCAGATGACCAGATCAGGTTTAAATTCCCCGATAATCCGTCGAAAT 34 8 

CTGAGGATGGAAGGAAGGTGAAGGCTGTTCCTGAAAGGAATAAAAGTGACATCATGCCCT 113 
I I II I 1 I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGAGGATGGAAGGAAGGTGAAGGCTGTTCCTGAAAGGAATAAAAGTGACATCATGCCCT 354 

CTTTTTCTGGCTTCCGGAGCAATTTTACTTTTTTCTCTGCAG 1181 

I I I II I I I I I I I I I I I I M I II I I I I II I I I I I I I I I I I I M 
CTTTTTCTGGCTTCCGGAGCAATTTTACTTTTTTCTCTGCAG 358 5 



>gb|CP000035.1| E9 Shigella dysenteriae Sdl97 plasrnid pSDl_197, complete seque 
Length=182726 

Score = 138 bits (152), Expect = 6e-29 
Identities = 207/289 (71%), Gaps = 4/289 (1%) 
Strand-Plus /Minus 

Query 237 GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 29 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 87 991 GCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CTGGCGAGCCATCTG 87 

Query 296 CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 35 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II 
Sbjct 87932 ATAAAAACGTCCAGGGTCATTA-AAAA/\ATCATTCAGGGCGCACATCCATGCGTGCTTAT 87 

Query 355 CCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCG 4 1 

II I I i I I I I I I I I I I III II I I I I I I I I I I I I I I I I j I I II II 
Sbjct 87873 TACCTGCTGGTAACAATGTCCCGGTCTGGTTGTTCTGAAT AACATCAGGGATTCCACCAA 8 7 

Query 4 1 5 TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACG 4 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct 87 813 TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGTACC AT ACCAAACG 8 7 

Query 4 75 CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 523 

III I I I I I I I I I I I I I II I I I I II I I I I I I Ml II II II 
Sb j ct 87 753 ATTCGTTTTCTGAAGGCAGAACCACCAGACTGGCAACCCGATATACGGG 8 77 0 5 



httn*//h1flct nrhi nlm nih orw/Rlact r>rri 



1 1 /ninnnQ 



NCBI BIast:BD205212:Nucleotide sequence for detecting.. 
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Query 


237 


Sbjct 


115503 


Query 


296 


Sbjct 


115562 


Query 


355 


Sbjct 


115621 


Query 


415 


Sbjct 


115681 


Query 


475 


Sbjct 


115741 



>gb|CP001064.1| 13 Escherichia coli 53638 plasmid p53638_226, complete sequenc 
Length-225683 

Features in this part of subject sequence: 
UDP- sugar hydrolase 

Score = 132 bits (146), Expect = 3e-27 
Identities - 206/289 (71%), Gaps - 4/289 (1%) 
Strand=Plus/Plus 

GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 2 

I I I I i I I I i 1 I I I I M I I I I I I I I I I M I I I I I I I I I 1 I I I I M 
GCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CTGGCGAGCCATCTG 1 

CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 3 

111! I I I I I I I i I I I I I I I I I I ! I I I I II I I I I M i 
ATAAAAACGCCCAGGGTCATTA-AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTAT 1 

CCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCG 4 

II I I I I I I M I I I I I I I I I I I I I II I i I I i I M I I I I I I I I M 
TACCTGCTGGTAACAATGTCCCGGTCTGGTTGTTCTGAATAACATCAGGGATTCCACCAA 1 

TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACG 4 

I II I I I I I I I I I ! II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I ^ 

TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGTACCATACCAAACG 1 

CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 52 3 

Ml EMM I I M I I I I I M I I I I II I I I I I III II II II 
ATTCGTTTTCTGAAGGCAGAACCACCAGACTGGCAACCCGATATACGGG 115789 

>gb|CP001062.1| 03 Shigella boydii CDC 3083-94 plasmid pBS512_211, complete se 
Length-210919 

Features in this part of subject sequence: 

putative glycosyl transferase, group 1 family protein 

Score = 132 bits (146), Expect = 3e-27 
Identities = 206/289 (71%), Gaps - 4/289 (1%) 
Strand=Plus /Minus 

GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 2 

I I I I I I M I I I I I I M I I M I M M I I I I I I M M I II MUM 

GCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CTGGCGAGCCATCTG 1 
CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 3 

II I I Mil! I I M I I I I I I M I I I I M I I M I I II 

ATAAAAACGCCCAGGGTCATTA-AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTAT 1 

CCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCG 4 

I | I II I I I I I I II I I III II I I I II I I I I I I II I I I M I II II 
TACCTGCTGGTAACAATGTCCCGGTCTGGTTGTTCTGAATAACATCAGGGATTCCACCAA 1 

TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACG 4 

|| I II I I I I I I I I I I I I I I II I I I I ! I I i I I I I M II I I I I I I I I 

TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGTACCATACCAAACG 1 

CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 52 3 

Mi I I I II II I II M M I II I I I II I M II Ml II II II 
ATTCGTTTTCTGAAGGCAGAACCACCAGACTGGCAACCCGATATACGGG 17 5968 

>gb I AF386526 . 1 | 83 Shigella flexneri 2a str. 301 virulence plasmid pCP301, com; 

sequence 

Length==221618 

Features in this part of subject sequence: 
UDP- sugar hydrolase 

Score = 132 bits (146), Expect - 3e-27 
Identities = 206/289 (71%), Gaps = 4/289 (1%) 
Str and=Plus /Minus 

Query 237 GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 2 

M I I I I I I M II I II I I M I I I I M I I I I I I I I M I I I I II I II 
Sbjct 193608 GCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CTGGCGAGCCATCTG 1 

Query 296 CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 3 



Query 


237 


Sbjct 


176254 


Query 


296 


Sbjct 


176195 


Query 


355 


Sbjct 


176136 


Query 


415 


Sbjct 


176076 


Query 


475 


Sbjct 


176016 



httn7/hln«t nrVn nlm niVt anv/Rln«3t r.oi 



71 /rnnnm 



NCBI Blast:BD205212:Nucleotide sequence for detecting.. 



Page 1 1 sur 1 7 



Sbj ct 


1 Q T t: a n 

i y o oh y 


Query 


jo j 


oD] Ct 




Query 


415 


Sbjct 


193430 


Query 


475 


Sbjct 


193370 



I I I I I I I I I I I i I I I ! I I I I I I I I I I I I 1 I I I I El 

ATAAAAACGCCCAGGGTCATTA-AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTAT 1 

CCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCG 4 

II I I I I I I M I II I I I I I I I I I I I I I I 1 I I I I I I I i I I I I I I 1 
TACCTGCTGGTAACAATGTCCCGGTCTGGTTATTCTGAATAACATCAGGGATTCCACCAA 1 

TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAMCG 4 

I I I I I I I I I I I I M I I I I I I I I I I I I I I 1 I I I I I I I I I I M I I M 
TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGTACCATACCAAACG 1 

CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 52 3 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I III M M M 
ATTCGTTTTCTGAAGGCAGAACCACCAGACTGGCAACCCGAT ATACGGG 193322 

>gb|AY206446.1| Shigella flexneri plasmid pINV_F6_Ml 382 ORF186 (ORF186) , VirK 

(virK), and MsbB2 (msbB2) genes, complete cds 

Length-3815 

Score - 132 bits (146), Expect - 3e-27 
Identities = 206/289 (71%), Gaps = 4/289 (1%) 
St rand=Plus /Minus 

GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 295 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I ! I I I I I I I I MINI 
GCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CTGGCGAGCCATCTG 17 5 

CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 354 

I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I II 
ATAAAAACGCCCAGGGTCATTA-AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTAT 17 0 

CCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCG 414 

II I I I I I I I I I I I I I I II II I I II I I I I I I I I I I I I I I I I I I I 
TACCTGCTGGT AACAATGTCCCGGTCTGGTTGTTCTGAATAACATCAGGGATTCCACCAA 164 

TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACG 4 7 4 

III MINI I I I I I I I I I I I I I I I II I I i I I I I I I I I ! I I i I I I I 

TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGTACCATACCAAACG 158 

CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 523 

III II I I I I I I I I I I I I I I I I I I I I I I I I I III II II II 
ATTCGTTTTCTGAAGGCAGAACCACCAGACTGGCAACCCGAT ATACGGG 1531 

>emb|AL391753.1 ISFPWR100 13 Shigella flexneri virulence plasmid pWRlOO: from 1 
Length=213494 

Features in this part of subject sequence: 
unnamed protein product 

Score - 132 bits (146), Expect = 3e-27 
Identities = 206/289 (71%), Gaps = 4/289 (1%) 
Strand^Pl us /Minus 

GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 2 

I I I I I I I I I I I I I I i I II I I I I I I I I I I I I I I I I II II I I I I I I 
GCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CTGGCGAGCCATCTG 1 

CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 3 

I I I I I I I I 1 I I I I I I I I I I I I I I I I I II I I I I I II 
ATAAAAACGCCCAGGGTCATTA-AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTAT 1 

CCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCG 4 

II I I I I I I I I I II ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TACCTGCTGGTAACAATGTCCCGGTCTGGTTATTCTGAATAACATCAGGGATTCCACCAA 1 

TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACG 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGTACCATACCAAACG 1 

CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 523 

II I I I I I I I I I I I I I I I I I I I I I II I I I I I III I I II II 
ATTCGTTTTCTGAAGGCAGAACCACCAGACTGGC AACCCGAT ATACGGG 18 6982 



Query 


237 


Sbjct 


1817 


Query 


296 


Sbjct 


1758 


Query 


355 


Sbjct 


1699 


Query 


415 


Sbjct 


1639 


Query 


475 


Sbjct 


1579 



Query 


237 


Sbjct 


187268 


Query 


296 


Sbjct 


187209 


Query 


355 


Sbjct 


187150 


Query 


415 


Sbjct 


187090 


Query 


475 


Sbjct 


187030 



>gb| AF348706. 1 1 13 Shigella flexneri 5a plasmid virulence plasmid pWRSOl, c 



omp 



sequence 
Length-221851 



NCBI Blast:BD205212:Nucleotide sequence for detecting.. 
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Query 




Sb j ct 


184956 


Query 


£ i7 0 


Sbjct 


184897 


Query 


355 


Sbjct 


184838 


Query 


415 


Sbjct 


1.84778 


Query 


475 


Sbjct 


184718 



Features in this part of subject sequence: 
UDP-sugar hydrolase 

Score « 132 bits (146), Expect = 3e-27 
Identities = 206/289 (71%), Gaps - 4/289 (1%) 
Strand=Plus /Minus 

GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 2 

II I I I I I I I I I 1 M I I 1 I I I I I I I I I I I I 1 I II I I I I I MINI 
GCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CTGGCGAGCCATCTG 1 

CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 3 

M II I I I I I I ! I I I I I I I I f I I I I I M I I I I I I I I 
ATAAAAACGCCCAGGGTCATTA-AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTAT 1 

CCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCG 4 

II I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I 1 I I ! I I I I I I 
TACCTGCTGGTAACAATGTCCCGGTCTGGTTATTCTGAATAACATCAGGGATTCCACCAA 1 

TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACG 4 
MINIMI! I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I M I 

TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGTACCATACCAAACG 1 

CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 52 3 

III I I I M I I I II I I I I I I I I I I I I I I I I I III I I II M 
ATTCGTTTTCTGAAGGCAGAACCACC AGACTGGCAACCCGAT AT ACGGG 184 670 

>gb|AY879342 . II 13 Shigella flexneri plasmid pSF5, complete sequence 
Length=136694 

Score = 132 bits (146), Expect = 3e-27 
Identities - 206/289 (71%), Gaps = 4/289 (1%) 
Strand=Plus/Plus 

GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 29 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I M I I I I 
GCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CTGGCGAGCCATCTG 62 

CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 35 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
ATAAAAACGCCCAGGGTCATTA-AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTAT 62 

CCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCG 41 

II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
TACCTGCTGGTAACAATGTCCCGGTCTGGTTATTCTGAATAACATCAGGGATTCCACCAA 62 

TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACG 47 
I | I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGTACCATACCAAACG 62 

CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 52 3 

III I I I I I I I I I I I I I I I I I I I I I I I I I M III II II II 
ATTCGTTTTCTGAAGGCAGAACCACCAGACTGGCAACCCGAT AT ACGGG 62 502 

>gb|CP000037.1| E3 Shigella boydii Sb227 plasmid pSB4_227, complete sequence 
Length=126697 

Score = 132 bits (146) , Expect - 3e-27 
Identities - 206/289 (71%), Gaps = 4/289 (1%) 
Strand=Plus /Minus 

GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 29 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill 
GCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CTGGCGAGCCATCTG 98 

CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 35 

I I I I I I I I I I I I I j I I I I I I I I I I I I I I I I I I I II 
ATAAAAACGCCCAGGGTCATTA-AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTAT 98 

CCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCG 41 

II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I 
TACCTGCTGGTAACAATGTCCCGGTCTGGTTGTTCTGAATAACATCAGGGATTCCACCAA 97 

TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACG 4 7 
I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I 

TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGTACCATACCAAACG 97 

CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 523 



Query 


237 


Sbjct 


62216 


Query 


296 


Sbjct 


62275 


Query 


355 


Sbjct 


62334 


Query 


415 


Sbjct 


62394 


Query 


475 


Sbjct 


62454 



Query 


237 


Sbjct 


98146 


Query 


296 


Sbjct 


98087 


Query 


355 


Sbjct 


98028 


Query 


415 


Sbjct 


97968 


Query 


475 



hftiW/hlfiRt nrhi nlm nih an v/R last co\ 



91 /07/7nOS 



NCBI Blast:BD2052 12 Nucleotide sequence for detecting.. 
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! I I I I t I I I I I I I I I I I I I I I I I I I i I I I I ! I I I I I I I I 
Sb j ct 97 908 ATTCGTTTTCTGAAGGCAGAACCACCAGACTGGCAACCCGATATACGGG 9 7 8 $0 

>gb|CP000039.1| 13 Shigella sonnei Ss046 plasmid pSS_0 4 6, complete sequence 
I,ength=214396 

Features in this part of subject sequence: 
conserved hypothetical protein 

Score = 132 bits (146), Expect = 3e-27 
Identities - 206/289 (11%), Gaps - 4/289 (1%) 
St rand=Plus /Minus 

GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 2 

I I III I I I I I I i I I I t I I I I I I I Mill I I I II Mill III III , 
GCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CTGGCGAGCCATCTG 1 

CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 3 

I I I I I I I II I I I I I I M I M I I I I I I I I I M I i II 
AT AAAAACGC CC AGGGTC ATT A- AAAAAAT CAT TCAGGGCGC AC ATCC AT GCGTGCT TAT 1 

CCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCCG 4 

II I I M I I II I I M I I I I I I I I I I I I i I M M I I I M I I I I M 
TACCTGCTGGTAACAATGTCCCGGTCTGGTTGTTCTGAATAACATCAGGGATTCCACCAA 1 

TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAACG 4 

III Mill I I II Mill I I M I M II MUM I I I I I I I 

TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGTACCATACCAAACG 1 

CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCG^ 52 3 

III II I I I I I II I I I I I I I I II I II I I I M II I II II II 
ATTCGTTTTCTGAAGGCAGAACCACCAGACTGGCAACCCGAT AT ACGGG 1582 8 4 

>dbj | D11025.1I SHFVIRK Shigella flexneri plasmid pMYSH6000 virK gene for virul 

complete cds 

Length=1642 

Score = 132 bits (146) , Expect = 3e-27 
Identities - 206/289 (71%), Gaps = 4/289 (1%) 
St rand=Plus /Minus 

GCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCAT 295 
I I II I I I M II I I II M I I I I I I I II II I I I 1 I I II M I I II M 
GCAGTTTT ATT AATATC AAACCGCTCTTCTATATCCTGTTTTGC -CTGGCGAGCCATCTG 27 6 

CCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACAT 354 

I I II I I I II I I M I II M I II I M I I I I I I I I I It 
ATAAAAACGCCCAGGGTCATTA-AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTAT 217 



Query 


237 


Sbjct 


158570 


Query 


296 


Sbjct 


158511 


Query 


355 


Sbjct 


158452 


Query 


415 


Sbjct 


158392 


Query 


475 


Sbjct 


158332 



Query 


237 


Sbjct 


334 


Query 


296 


Sbjct 


275 


Query 


355 


Sbjct 


216 


Query 


415 


Sbjct 


156 


Query 


475 


Sbjct 


96 



II I I II II 1 1 1 1 II I I II 1 1 II 1 1 II II 1 1 1 II II II 1 1 



414 
157 



TCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGT ACCATACCAAACG 4 7 4 

I II I I II I I I II II II I I I I I II I I I I II I I II I I I I I I I i II I I 

TCTGACTGGCCAGTACAGGCACAGAAAATGCCGATGCTTCTGCCAGT ACCATACCAAACG 97 

CTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 523 

III Mill I II II I I I I II II I I I I I II I I III M II M 
ATTCGTTTTCTGAAGGCAGAACCACCAGACTGGCAACCCGAT AT ACGGG 4 8 

>gb|CP000799.1 I 13 Escherichia coli E24377A plasmid pETEC_74, complete sequenc- 
Length=74224 

Score = 111 bits (122), Expect - 9e-21 
Identities - 148/206 (71%), Gaps - 0/206 (0%) 
Strand=Plus /Minus 

Query 318 AAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCCACGGGTAACAGCGTCCCT 37 

I I I I II II I I I II I I I I I I I M I II i I I I I I II I I I I I II 
Sbjct 6132 5 AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTATTACCTGCTGGTAACAATGTCCCG 61 

Query 378 GTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCACTGGCGATAACGGGCACG 4 3 

III I I II I II I I I I I I I I I I I I I II I I I I I I I II I I I I II I I I 
Sbjct 612 65 GTCTGGTTGTTCTGAATAACCTCAGGGATTCCACCAATCTGAGTGGCCACTACAGGCACA 61 

Query 4 38 CCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCATTTTCCGAAGGCATGACC 4 9 



hfhW/hlast nrhi nlrn m*Vi unv/R lust r.o\ 



9 1 /rv7/?nns 



NCBI Blast:BD205212:NucIeottde sequence for detecting. 
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I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I III 
Sbjct 61205 GAMATGCCGATGCTTCTGCCAGCACC AT ACC AAAAGATTCGTTTTCTGAAGGCAGAACC 6 1 

Query 4 98 ACCACACTGGCAATCCGGTAGACCGG 52 3 

MM 1 I I I I II I III II I I M 
Sbjct 6114 5 ACCAGACTGGCAACCCGATATACGGG 61120 

>gb|CP000795 . 1 1 0 Escherichia coli E24377A plasmid pETEC__80, complete sequent 
Length=79237 

Score « 111 bits (122), Expect = 9e-21 
Identities - 148/206 (71%), Gaps = 0/206 (0%) 
Strand=Plus/Plus 

AAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCCACGGGTAACAGCGTCCCT 37 
I I I I I I I f I I I I I I I I I I M I I i II I I I I I I I I I I I I I I I 
AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTATTACCTGCTGGTAACAATGTCCCG 37 

GTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCACTGGCGATAACGGGCACG 43 
III I I I I I M I I I I I I I I I I I I I I I I I I I I i I I I I ! II MM 
GTCTGGTTGTTCTGAATAACCTCAGGGATTCCACCAATCTGAGTGGCCACTACAGGCACA 37 

CCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCATTTTCCGAAGGCATGACC 4 9 

i I II Mill I I II I I I I I I I I I M I i Ml I II I I I I I I I I 1 Ml 
GAAAATGCCGATGCTTCTGCCAGCACCATACCAAAAGATTCGTTTTCTGAAGGCAGAACC 37 

ACCACACTGGCAATCCGGTAGACCGG 523 
I I I I I I I I I M I II I II II II 
ACCAGACTGGCAACCCGATATACGGG 37382 

>dbj IAB255435.il EI E scherichia coli plasmid p086Al DNA, complete sequence 
Length-120730 

Score = 111 bits (122), Expect » 9e-21 
Identities = 148/206 (71%), Gaps = 0/206 (0%) 
Strand=Plus/Plus 

Query 318 AAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCCACGGGTAACAGCGTCCCT 377 

I I I I II II I I I I M I I I I I II I I II M i I I I M M I I I I I 
Sbjct 2154 AAAAAATCATTC AGGGCGCACATCCATGCGTGCTTATTACCTGCTGGTAACAATGTCCCG 221 

Query 378 GTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCACTGGCGATAACGGGCACG 4 37 

Ml M I I I M I M I I I I M I M I M M I II I I I I I I M I I I I I 
Sbjct 2214 GTCTGGTTGTTCTGAATAACCTCAGGGATTCCACCAATCTGAGTGGCCACTACAGGCACA 227 

Query 4 38 CCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCATTTTCCGAAGGCATGACC 4 97 

I I II II I I I I I I II I I I I I I I M I I I III I I I I I I II I II I III 
Sb j ct 22 74 GAAAATGCCGATGCTTCTGCCAGCACCATACCAAAAGATTCGTTTTCTGAAGGCAGAACC 2 33 

Query 498 ACCACACTGGCAATCCGGTAGACCGG 523 

I II I I I I I M I I III II I I M 
Sbjct 2 3 34 ACCAGACTGGCAACCCGATATACGGG 2359 



Query 


318 


Sbjct 


37177 


Query 


378 


Sbjct 


37237 


Query 


438 


Sbjct 


37297 


Query 


498 


Sbjct 


37357 



>gb|AF134403,l|AF134403 Escherichia coli plasmid pAA2 Shf (shf ) , hexosyltrans 

(capU) , and VirK (virK) genes, complete cds 

Length-3500 

Score = 111 bits (122), Expect - 9e-21 
Identities = 148/206 (71%) , Gaps = 0/206 (0%) 
Strand=Plus /Minus 

AAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCCACGGGTAACAGCGTCCCT 377 

M II II I I I I I I I I I I II I I II I II II I I I I M M I II II 
AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTATTACCTGCTGGTAACAATGTCCCG 227 

GTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCACTGGCGATAACGGGCACG 4 37 

ill II I I II I I M I I M II I I I I I I I I I II I I I I I I II I I II I 
GTCTGGTTGTTCTGAATAACCTCAGGGATTCCACCAATCTGAGTGGCCACTACAGGCACA 221 

CCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCATTTTCCGAAGGCATGACC 4 97 

I I II Mill Mill II I I I I I I I M I III I M M M II I I I I II 
GAAAATGCCGATGCTTCTGCCAGCACCATACCAAAAGATTCGTTTTCTGAAGGCAGAACC 215 

ACCACACTGGCAATCCGGTAGACCGG 523 
I I I I M I M I I I ill I I II II 
ACCAGACTGGCAACCCGATATACGGG 212 8 



Query 


318 


Sbjct 


2333 


Query 


378 


Sbjct 


2273 


Query 


438 


Sbjct 


2213 


Query 


498 


Sbjct 


2153 
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>emb | CR942285 . 1 | EH Escherichia coli plasmid pCoo 
Length=90396 

Score = 111 bits (122), Expect = 9e-21 
Identities = 148/206 (71%), Gaps = 0/206 (0%) 
Strand-Plus/Plus 



Query 318 AAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACATCCCCCACGGGTAACAGCGTCCCT 37 

I I I I I I I I Ml II II I II I I I I ! II II I I I I I II I I I I I I 
Sbjct 34153 AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTATTACCTGCTGGTAACAATGTCCCG 34 

Query 378 GTCACATTCTTCTGAATGACATCAGGGATCCCGCCCGTCTCACTGGCGATAACGGGCACG 4 3 

III I I I I I I I I II I I I I I I I I I I i I I I III I MM I I I I I I I I 
Sbjct 34213 GTCTGGTTGTTCTGAATAACCTCAGGGATTCCACCAATCTGAGTGGCCACTACAGGCACA 34 

Query 4 38 CCGGAGACTGACGCTTCAGCCAGTACCATACCAAACGCTTCATTTTCCGAAGGCATGACC 4 9 

I I II I I I I I Mill I II I I II I I I I I III Mill II I I II I III 
Sbjct 34 273 GAAAATGCCGATGCTTCTGCCAGCACCATACCAAAAGATTCGTTTTCTGAAGGCAGAACC 34 

Query 498 ACCACACTGGCAATCCGGTAGACCGG 523 

I I II I I I II II I III M II II 
Sbjct 34333 ACCAGACTGGCAACCCGATATACGGG 34358 



>gb|CPO00800.1| BH Escherichia coli E24377A, complete genome 
Length=4979619 

Features in this part of subject sequence: 

putative glycosyl transferase/ group 1 family protein 

Score = 107 bits (118), Expect - le-19 
Identities - 201/290 (69%), Gaps - 4/290 (1%) 
Strand^Plus/Plus 



Query 


236 


Sbjct 


4845818 


Query 


295 


Sbjct 


4845877 


Query 


354 


Sbjct 


4845936 


Query 


414 


Sbjct 


4845996 


Query 


474 


Sbjct 


4846056 



TGCCGTTCTGTTAATATCAAACCGGTACTCAATATCTTCTCTGGCGCTGGCT-GCCATCA 
I I I I I I I M I I I II I II II II I I I I I I I I I I I I I I II I I II II I 
TGCAGTTTTATTAATATCAAACCGCTCTTCTATATCCTGTTTTGC-CAGGCGAGCCATCT 

TCCGGAAGCGTTCCGG-TCGGGATAAAAAATCGCGCAGTGCGCCGGTCCATGCAGACACA 
I I II Mill I I II I II M II I II II II II I I I I I 
GATAAAAACGCCCAGGGTCATTA-AAAAAATCATTCAGGGCGCACATCCATGCGTGCTTA 

TCCCCCACGGGTAACAGCGTCCCTGTCACATTCTTCTGAATGACATCAGGGATCCCGCCC 
I I I I I I II II I I II I II I I I I II I I II I I I II I II I II II I I 
TTACTTGCTGGTAACAATGTCCCGGTCTGGTTGTTCTGAATAACCTCAGGGATTCCACCA 

GTCTCACTGGCGATAACGGGCACGCCGGAGACTGACGCTTCAGCCAGTACCATACCAAAC 

I II I I I II I I I I II II I f I I II I II I II II I I II I II I II I 

ATCTGAGTGGCCACTACAGGCACAGAAAATGCCGATGCTTCTGCCAGCACCATACCAAAA 

GCTTCATTTTCCGAAGGCATGACCACCACACTGGCAATCCGGTAGACCGG 523 
I III I I II I I II I I I I I M II II I II II II I III II II II 
GATTCGTTTTCTGAAGGCAGAACCACCAGACTGGCAACCCGATATACGGG 4 84 6105 



>gb|AC133908 . 7 I 13 Mus musculus chromosome 5, clone RP24-299L9, complete seque 
Length=214270 

Score. = 46.4 bits (50), Expect - 0.30 
Identities = 28/30 (93%), Gaps = 0/30 (0%) 
St rand=Plus /Minus 

Query 4 CAGGAGATGGAAAAAAAGCCAAAATAAAAA 33 

I M I II I II M I I II II I I II II I II II 
Sbjct 1854 62 CAGGAGATGGAAAAAAAGCTGAAATAAAAA 185433 



>gb| AC127327 . 4 | 13 Mus musculus BAG clone RP23-254M18 from 5, complete sequenc 
Length=198433 

Score = 46.4 bits (50), Expect = 0.30 
Identities = 28/30 (93%), Gaps = 0/30 (0%) 
Strand=Plus /Minus 

Query 4 CAGGAGATGGAAAAAAAGCCAAAATAAAAA 33 

I I I I II II I II II I I I II I I I I II II I I 
Sbjct 92692 CAGGAGATGGAAAAAAAGCTGAAATAAAAA 92663 
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>gb|EU025714,l| 13 Salmo salar retinoic acid receptor gamma a (Rarga) , coiled- 

transcriptional coactivator a <Kiaal536) , homeobox protein 

HoxC13aa (HoxC13aa) , homeobox protein HoxC12aa (HoxC12aa), 

homeobox protein HoxCllaa (HoxCliaa) , homeobox protein HoxClOaa 

(HoxClOaa), homeobox protein HoxC9aa (HoxC9aa) , homeobox 

protein HoxC8aa {HoxC8aa) , and homeobox protein HoxC6aa (HoxC6aa) 

genes, complete cds 

Length-212839 

Features in this part of subject sequence: 
retinoic acid receptor gamma a 

Score = 44.6 bits (48), Expect = 1.1 
Identities = 27/29 (93%), Gaps - 0/29 (0%) 
Strand=Plus/Plus 

Query 1077 AGTCT GAGG AT GG AAGG AAGGT G AAGG C T 1105 

III I I I I I I I I I i I! I I II I I I I I I I I 
Sbjct 31660 AGTATGAGGATGGAAGGAAGGTGAGGGCT 31688 

>emb | BX571861.il EI Photorhabdus luminescens subsp. laumondii TTOl complete ge: 
segment 3/17 
Length=34 2905 

Features in this part of subject sequence: 
unnamed protein product 

Score = 44.6 bits (48), Expect =1.1 
Identities = 24/24 (100%), Gaps = 0/24 (0%) 
Strand^Plus /Minus 

Query 20 ■ • • - :•• - * ..TTGCCCATC 4 3 

I I I I I I I I I I I I I I I I I I I I II I I 
Sbjct 27988 AGCCAAAATAAAAAATTGCCCATC 27965 

>gb|DQ192243.1| Operophtera brumata reovirus segment 9, complete sequence 
Length=154 7 

GENE ID: 5076725 OBRV_s9gpl I hypothetical protein 
[Operophtera brumata reovirus] 

Score = 42.8 bits (46), Expect - 3.7 
Identities = 26/28 (92%), Gaps = 0/28 (0%) 
Strand=Plus /Minus 

Query 24 7 TAATATCAAACCGGTACTCAATATCTTC 27 4 

I I I I M I I I i I I I I I I I I I I I I I I I I 
Sbjct 2 60 TAATATCAAACCGGTATTCAATACCTTC 233 

>gb|AC145866.3 I 0 Pan troglodytes BAC clone RP43-21B7 from chromosome 7, comp 

sequence 

Length=189814 

Score = 42.8 bits (46), Expect = 3.7 
Identities - 28/31 (90%), Gaps = 0/31 (0%) 
Strand=Plus/Plus 

Query 366 AACAGCGTCCCTGTCACATTCTTCTGAATGA 3 96 

I I I I I I I I I I I I I I I I I I I I I I I i I I I I 
Sbjct 137128 AACAGAGTCCCTTTCACATCCTTCTGAATGA 137158 

>gb| AE015928 . 1 | EES Bacteroides thetaiotaomicron VPI-5482, complete genome 
t,ength=6260361 

Features in this part of subject sequence: 
His tone -like bacterial DNA-binding protein 

Score - 42.8 bits (46), Expect - 3.7 
Identities = 23/23 (100%), Gaps = 0/23 (0%) 
Strand=Plus/Plus 

Query 855 CCTTCCGCCATCAGATGCGCCAT 877 
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I I I I I i I I I I I I i I 1 1 I I I I II I 

Sbjct 6023752 CCTTCCGCCATCAGATGCGCCAT 6023774 

>gb|AC004844.1| BE Homo sapiens PAC clone RP4-613I23 from 7pll-pl3, complete 
Length=141895 

Score = 42.8 bits (46), Expect = 3.7 
Identities = 28/31 (90%), Gaps - 0/31 (0%) 
Strand=Plus/Plus 

Query 36 6 AACAGCGTCCCTGTCACATTCTTCTGAATGA 396 

I I I I I I II I I I I I I I I I I I II I I I I I I I 
Sbjct 110 633 AACAGAGTCCCTTTCACATCCTTCTGAATGA 110 663 



Database: All GenBank+EMBL+DDB J+PDB sequences (but no EST , STS, 
GSS, environmental samples or phase 0, 1 or 2 HTGS sequences) 
Posted date: Jul 20, 2008 5:48 PM 
Number of letters in database: -1,464,584,741 
Number of sequences in database: 7,064,549 

Lambda K H 

0.634 0.408 0.912 
Gapped 

Lambda K H 

0.634 0.408 0.912 
Matrix: blastn matrix:2 -3 
Gap Penalties: Existence: 5, Extension: 2 
Number of Sequences: 7064549 
Number of Hits to DB: 12224137 
Number of extensions: 604140 
Number of successful extensions: 8350 
Number of sequences better than 10: 0 
Number of HSP's better than 10 without gapping: 0 
Number of HSP's gapped: 8350 
Number of HSP's successfully gapped: 0 
Length of query: 1181 
Length of database: 24305219031 
Length adjustment: 36 
Effective length of query: 1145 
Effective length of database: 24050895267 
Effective search space: 27 538275080715 
Effective search space used: 27538275080715 
A: 0 

XI: 22 (20.1 bits) 
X2: 33 (29.8 bits) 
X3: 110 (99.2 bits) 
SI: 29 (27.4 bits) 
S2: 45 (41.9 bits) 
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